Health informatics sits at the vibrant intersection of medicine, data science, and technology, transforming how we store, analyze, and utilize health information. This rapidly evolving field empowers clinicians and researchers to uncover patterns in patient data, improve diagnostic accuracy, and personalize treatment plans without getting lost in complex databases. By turning raw medical records into actionable insights, these innovations are reshaping the future of healthcare delivery and population health management.

At Gist.Science, we bridge the gap between cutting-edge research and public understanding by curating the latest preprints from medRxiv specifically within this domain. Our team processes every new submission in this category, providing both accessible plain-language explanations and detailed technical summaries to ensure the science is clear for everyone, from policymakers to curious readers. Below are the latest papers in health informatics, freshly distilled and ready for you to explore.

Aggregate benchmark scores obscure patient safety implications of errors across frontier language models

This study demonstrates that aggregate benchmark scores fail to capture critical patient safety risks in frontier language models for healthcare, as significant and unpredictable variations in error directionality, contextual bias, and crisis response across models reveal that overall accuracy alone cannot predict clinical safety.

Linzmayer, R., Ramaswamy, A., Hugo, H., Nadkarni, G., Elhadad, N.2026-03-20📄 health informatics

Joint Longitudinal-Survival Modelling of Patient-Reported Gastrointestinal Symptom Trajectories and Treatment Discontinuation in Irritable Bowel Syndrome: A Prospective Cohort Study from the Canadian Gut Project

This prospective cohort study of 2,847 Canadian IBS patients utilizes joint longitudinal-survival modeling to demonstrate that individual symptom trajectories are dynamically linked to treatment discontinuation, revealing that higher baseline severity and slower rates of symptom improvement significantly increase the risk of stopping therapy.

Thornton, E., Kellerman, J.2026-03-19📄 health informatics

Clinician Experiences with Ambient AI Scribe Technology in Singapore: A Qualitative Study

This qualitative study of 28 clinicians at Singapore's Alexandra Hospital reveals that while ambient AI scribe technology offers significant potential to reduce administrative burden and enhance patient engagement, its successful implementation in Singapore's multilingual healthcare system requires addressing critical challenges related to documentation accuracy, workflow adaptation, and compliance with local privacy regulations.

Shankar, R., Goh, A., Xu, Q.2026-03-19📄 health informatics

OpenScientist: evaluating an open agentic AI co-scientist to accelerate biomedical discovery

The paper introduces OpenScientist, an open-source agentic AI co-scientist that significantly accelerates biomedical discovery by autonomously executing complex data analyses and generating verifiable clinical insights across diverse case studies, reducing tasks that typically take weeks to mere minutes.

Roberts, K. F., Abrams, Z. B., Cappelletti, L., Moqri, M., Heugel, N., Caufield, J. H., Bourdenx, M., Li, Y., Banerjee, J., Foschini, L., Galeano, D., Harris, N. L., Li, M., Ying, K., Melendez, J. A. (…)2026-03-18📄 health informatics

Falsification Testing of Sepsis Prediction Models: Evaluating Independent Biological Signal After Controlling for Care-Process Intensity

This pre-registered falsification study across four clinical datasets demonstrates that while sepsis prediction models at elite academic centers primarily detect genuine biological signals rather than care-process intensity, they reveal a systematic and consequential divergence between clinical sepsis definitions and administrative coding that undermines the validity of regulatory metrics and AI benchmarks relying on the latter.

Dickens, A. R.2026-03-18📄 health informatics

Persistent Proxy Discrimination in HIV Testing Prediction Models: A National Fairness Audit of 386,775 US Adults

This national fairness audit of 386,775 US adults demonstrates that enforcing demographic parity in HIV testing prediction models is inappropriate for differential-burden clinical contexts, as it significantly reduces screening access for high-risk populations and underscores the need for fairness metrics like equalized odds and calibration that align with clinical needs.

Farquhar, H.2026-03-16📄 health informatics